Picture for Wenxuan Zhang

Wenxuan Zhang

Toward Onboard AI-Enabled Solutions to Space Object Detection for Space Sustainability

Add code
May 03, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

Assessing Judging Bias in Large Reasoning Models: An Empirical Study

Add code
Apr 14, 2025
Viaarxiv icon

Query-based Knowledge Transfer for Heterogeneous Learning Environments

Add code
Apr 12, 2025
Viaarxiv icon

FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models

Add code
Feb 25, 2025
Viaarxiv icon

SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia

Add code
Feb 10, 2025
Viaarxiv icon

UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping

Add code
Jan 10, 2025
Viaarxiv icon

Knowledge Boundary of Large Language Models: A Survey

Add code
Dec 17, 2024
Figure 1 for Knowledge Boundary of Large Language Models: A Survey
Figure 2 for Knowledge Boundary of Large Language Models: A Survey
Figure 3 for Knowledge Boundary of Large Language Models: A Survey
Figure 4 for Knowledge Boundary of Large Language Models: A Survey
Viaarxiv icon

ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models

Add code
Dec 17, 2024
Viaarxiv icon

Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers

Add code
Dec 12, 2024
Viaarxiv icon